Perturbation in Gci's and Speech Quality for Pitch Synchronous Synthesis

نویسندگان

  • Parveen K. Lehana
  • Prem C. Pandey
چکیده

In pitch synchronous speech synthesis the analysis/synthesis of the speech is done at each glottal closure instant (GCI). The errors in estimation of GCI's affect the quality of the synthesized speech. The effect of random perturbations in the GCI's, obtained from the speech and from glottal signal from an impedance electroglottograph using Childers and Hu's algorithm on the quality of speech synthesized using harmonic plus noise model (HNM), is investigated in this paper. Investigations show that the speech quality is very sensitive to positions of the GCI's. A small perturbation with maximum of 4 % of the local fundamental frequency considerably degrades the synthesized speech. Perturbations above 8 % severely affect quality of the out put speech. GCI's obtained from the glottal signal can afford slightly more perturbation as compared to the GCI's calculated from the speech signal.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals

Timeand pitch-scale modifications of speech signals find important applications in speech synthesis, playback systems, voice conversion, learning/hearing aids, etc.. There is a requirement for computationally efficient and real-time implementable algorithms. In this paper, we propose a high quality and computationally efficient timeand pitch-scaling methodology based on the glottal closure inst...

متن کامل

Dct Based Pitch Modification

In this paper, we propose a novel algorithm for pitch modification. The linear prediction residual is obtained from pitch synchronous frames by inverse filtering the speech signal. Then Discrete Cosine Transform (DCT) is applied on these pitch synchronous frames. Based on the desired factor of pitch modification, the dimension of the DCT vector is changed by truncation or zero padding, and then...

متن کامل

High-Quality Speech Modification Based on Pitch- Synchronous Harmonic and Non-harmonic Modeling of Speech

In this paper, we propose a high-quality speech modification method based on pitch-synchronous harmonic and non-harmonic modeling of speech. In the proposed method, the harmonic and non-harmonic parts of speech are modeled by the sum of sinusoids with frequencies corresponding to pitch multiples and with randomized frequencies, respectively. Then, harmonic and nonharmonic parts are synthesized ...

متن کامل

Speech Synthesis in Indian Languages

This paper presents the study of phonemes in the Indian languages for developing good quality speech synthesis. Harmonic plus noise model (HNM) which divides the speech signal in two sub bands: harmonic and noise, is implemented with the objective of studying its capabilities and to investigate the adaptation needed. Childers and Hu's algorithms are used for voicing and pitch detection. As the ...

متن کامل

Unit selection using pitch synchronous cross correlation for Japanese concatenative speech synthesis

We describe a corpus-based approach to improving synthesized speech quality and present two useful cost functions for unit selection. One is pitch-synchronous cross correlation for concatenation costs to reduce the noise caused by phase mismatch at concatenation points. The other is a discontinuous cost function for internal and concatenation costs to eliminate unnecessary cost calculation. An ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003